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Economic  Aspects  of  Optimal  Disability  Benefits 
P.  Diamond  and  E.  Sheshinski* 

1.   Introduction 

The  US  social  security  system  began  with  retirement  benefits  but  no 
disability  benefits.   When  disability  insurance  was  added  to  the  system 
(1956) ,  the  formula  for  retirement  benefits  was  adapted  to  determine  disabili- 
ty benefits.   With  the  introduction  of  early  retirement  benefits  at  age  62 
(1956  for  women,  1961  for  men) ,  there  has  been  a  three  year  overlap  when 
workers  could  apply  for  either  retirement  or  disability  benefits.   Economists 
have  been  studying  the  relationship  between  labor  supply  of  older  workers  and 
the  availability  of  retirement  benefits.-^   Economists  have  also  studied  the 
optimal  design  of  retirement  benefits  in  recognition  of  the  effect  of  benefits 
on  labor  supply.^  While  it  has  been  recognized  that  disability  benefits  also 
affect  labor  supply,^  we  are  unaware  of  any  studies  of  the  optimal  design  of 
disability  and  retirement  benefits  in  recognition  of  the  effect  of  benefits  on 
labor  supply.*  This  paper  begins  such  a  study,  using  a  simple  static  model 
to  analyze  optimal  levels  of  disability  and  welfare  (or  retirement)  benefits 


'Research  supported  by  the  National  Science  Foundation  and  the  Binational 
Science  Foundation.   Research  assistance  done  by  Douglas  Galbi. 

^Diamond  and  Hausman  (1984),  Burtless  and  Moffitt  (1984),  and  Gustman  and 
Steinmeier  (1985). 

^Diamond  and  Mirrlees  (1978),  (1982),  (1986). 

^Halpern  and  Hausman  (1986),  Leonard  (1986). 

''The  paper  by  N.  Stern  (1982)  can  be  interpreted  as  contrasting  welfare 
benefit  and  disability  programs.  For  another  analysis  of  disability  insurance 
that  recognizes  imperfect  measurement  of  disability,  see  D.  Parsons  (1990).  See 
also  G.  Akerlof  (1978)  for  optimal  benefits  with  one  type  of  classification 
error,  but  not  both  types.  For  a  general  discussion  of  disability  programs  in 
eight  countries,  see  Haveman,  Halberstadt,  and  Burkhauser  (1984). 
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with  recognition  of  the  imperfect  nature  of  disability  evaluation. 

We  begin  with  a  discussion  of  the  problems  of  modeling  disability  and 
disability  screening  mechanisms.   We  next  analyze  the  optimal  structure  of 
benefits  for  a  given  disability  screening  mechanism  in  a  model  with  a  continu- 
ous distribution  of  disability  levels.   We  then  specialize  the  model  to  the 
case  of  only  two  disability  levels  to  simplify  the  analysis  of  comparative 
statics  and  to  examine  optimal  disability  standards  and  benefits  levels.   We 
return  to  the  continuous  model  for  additional  analysis  of  optimal  standards 
and  benefits.   We  conclude  with  a  logarithmic  example  and  suggestions  for 
further  research. 
2.  Modelling  Disability 

At  any  time  there  is  a  vast  array  of  actual  and  potential  marginal 
products  among  the  employed  and  not-employed  individuals  in  the  economy. 
Similarly,  there  is  a  vast  spread  in  the  difficulties  or  disutilities  associ- 
ated with  working.   For  some,  the  alternative  source  of  income  should  they  not 
work  is  disability  income  rather  than  retirement  income  or  welfare  benefits. 
In  principle,  social  security  disability  benefits  are  available  for  those 
unable  to  find  any  remunerative  employment  as  a  consequence  of  their  disabili- 
ties.  Yet,  some  people  with  considerable  handicaps  are  employed,  while  others 
with  seemingly  similar  handicaps  are  receiving  benefits.   While  eligibility 
for  benefits  is  straightforward  for  some  handicaps,  with  others  it  is  diffi- 
cult to  determine.   For  some  handicaps  it  is  hard  for  any  outsider  to  judge 
the  ability  to  work  or  its  difficulty. 

Our  problem  is  to  model  this  complex  picture  in  a  way  that  is  helpful 
for  the  analysis  of  economic  variables  such  as  benefit  levels.   Since  we  will 
analyze  a  static  model,  we  will  not  explicitly  model  individual  histories 


3 
(including,  e.g.,  industrial  accidents)  which  may  shed  light  on  abilities  to 
work.   We  have  identified  two  separate  aspects  of  working  -  the  ability  to 
carry  on  economically  useful  activities  and  the  difficulty  (including  physical 
pain)  in  carrying  them  out.   For  convenience,  we  will  work  with  a  single 
variable  by  assuming  that  all  individuals  have  the  same  marginal  products , 
thus  ignoring  correlations  between  productivity  and  disability  incidence.   In 
contrast,  we  will  assume  that  each  individual  has  an  additive  disutility  of 
work,  with  disutilities  distributed  over  all  nonnegative  values.   For  the 
modeling  purposes  here,  it  makes  little  difference  whether  someone  is  totally 
unable  to  work  or  has  an  infinite  disutility  of  work.   We  further  simplify  the 
model  by  assuming  that  all  workers  have  the  same  (concave,  increasing)  utility 
of  consumption,  u(c) ,  while  all  nonworkers  have  the  same  (concave,  increasing) 
utility^  of  consumption  v(c) .   Thus  the  distribution  of  disutilities  plays  a 
central  role  in  determining  willingness  to  work.   We  take  work  to  be  a  0-1 
variable  and  do  not  model  varying  hours  or  work  intensity. 

Recognizing  the  problem  of  determining  disability,  we  assume  that 
disutility  is  not  perfectly  observable.   For  the  start,  we  assume  that  there 
is  a  disability  evaluation  mechanism  in  place. ^  When  someone  with  disutility 
0  is  evaluated  for  disability,  there  is  a  probability,  p(8)  that  the  individ- 
ual will  be  judged  disabled.^  Naturally,  we  assume  that  p(8)  is  increasing 
in  e  and,  for  notational  convenience,  that  p(0)  =  0.   In  Section  6  we  look 


^We  also  assume  that  both  u'  and  v'  go  from  oo  to  0  as  c  goes  from  0  to  +». 

^We  assume  that  the  evaluation  process  is  costless  for  both  the  applicant 
and  the  evaluator.   It  would  be  interesting  to  introduce  costs. 

^We  assume  that  p(9)  is  known  to  the  government  decision  makers.  Obviously, 
it  is  not  precisely  known,  but  one  can  make  reasonable  hypotheses  about  its  basic 
form.  This  paper  can  be  interpreted  as  a  mapping  from  beliefs  about  the  accuracy 
of  disability  evaluation  into  desired  benefit  levels.  For  some  evidence  on 
disability  evaluation,  see  S.  Nagi  (1969).  For  evidence  that  some  people  denied 
disability  benefits  work  while  others  do  not,  see  J.  Bound  (1989). 
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into  the  public  decisions  which  affect  the  shape  of  p(8).   These  decisions 
relate  the  sizes  of  type  I  and  type  II  errors  to  the  strictness  of  the 
disability  standard. 
3.   Continuous  Model 

We  consider  three  different  consumption  levels,  Cg  for  active  workers, 
Cjj  for  welfare  (or  early  retirement)  beneficiaries,  and  Cj  for  disabled 
beneficiaries.   The  utility  function  for  a  worker  with  labor  disutility  level 
6  is  written  u(Ca)  —  6 .   We  assume  that  6  is  nonnegative  and  distributed  in 
the  population  with  distribution  F(e)  (and  density  f(8)).   For  convenience, 
we  assume  that  f(6)  is  continuous  and  positive  for  all  nonnegative  values  of 
6 .   The  utility  function  for  nonworkers  is  v(c) ,  where  c  will  take  on  the 
values  Cb  or  c^  depending  on  whether  the  nonworker  is  receiving  welfare  or 
disability  benefits.   We  assume  that  disability  benefits  are  at  least  as  large 
as  welfare  benefits  (otherwise,  there  is  no  reason  to  have  a  separate  disabil- 
ity program  with  a  selective  screening  mechanism) . 

Cd  ^  Cb  (1) 

Given  the  utility  functions  and  the  consumption  levels  for  workers  and 
nonworkers,  we  can  analyze  which  individuals  would  prefer  working  to  receiving 
retirement  benefits  and  which  would  prefer  working  to  receiving  disability 
benefits  (if  eligible).   Those  with  disutilities  of  labor  below  the  threshold 
values  which  equate  utilities  will  choose  to  work.   Thus  the  threshold  values 
satisfy 

6b  =  Max[0,  u(c^)  -  v(Cb)] 

(2) 

6.  =  Max[0,  u(cj  -  v(Ch)] 
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Since  disability  benefits  are  greater  than  retirement  benefits  (1) ,  the 
threshold  labor  disutility  for  persons  who  apply  for  disability  benefits  is 
lower  than  the  threshold  for  those  who  apply  for  retirement  benefits.   Thus  we 
have : 

%  ^  Qi  (3) 

With  the  fraction  of  the  population  with  disutility  6  who  are  eligible  for 
disability  benefits  written  as  p(6),  individuals'  labor  supply  choices  and 
consumption  levels  are  as  shown  in  Figure  1. 

We  normalize  by  setting  the  marginal  product  of  a  worker  equal  to  one. 
We  can  now  state  the  problem  of  the  selection  of  consumption  levels  to 
maximize  social  welfare  as 


-a 

Maximize/"  [u(c3)  -SldF  (0) 

0 
Ob 

+  1  [p(e)v(Cd)  +  (i-p{e))  (u(cj  -e]dF{0) 
[  [p(e)v(Cd)  +  {i-p(e))v(Cb)]dF(0) 


(4a) 


+ 

Ob 


Subject   to: 


f  (c^-l)dF  +  j  [p(0)Cd  +   (l-p(0))  (c^-l)]dF 


0 

CO 

+J    [p(0)Cd  +    (l-p(0))cJdF  =  R 

Ob 


(4b) 


where  R  are  the  resources  available  to  the  economy.   We  first  discuss  the 
first  order  conditions  under  the  assumptions  that  6^  >  0  and  6^,  >   6^.   We  then 
state  sufficient  conditions  for  these  strict  inequalities  to  hold  at  the 


optimum. 

Using  Lagrange  multiplier  A  and  assuming  also  that  0(j  >  0,  we  can  state 
the  first  order  conditions  as  the  resource  constraint  (4b)  and  the  three 
equations 


[u'(cj  -X]  /  dF  +  [u'(cj  -X]  J  (l-p(0)dF 


+  A(c -i-cj  (i-p(0j)f  (ej 


a 


ac 


a 


=  X[(c^-1-Cd)p(0d)f  (ej  +  (c^-l-Cfc)  (l-p(eb)))f  (0t)]u^:',) 

5a) 


[v/{Cd)  -X]  Jp(0)dF  =  -X(c^-1-Cd)p(0d)f  (ed)v'(Cd)      (5b) 


[vaCb)-A.]  f  (l-p(0)dF  =  -X(c  -1-cJ  (l-p(0J)f  (0Jv/(c.)   (5c) 


L 


where  we  have  used  (2)  to  obtain  th  derivatives  of  61,  and  8^.   If,  at  the 
optimiim,  6(j  is  zero,  we  have  the  same   onditions  since  p(0)  =  0.   The  left 
sides  of  all  three  expressions  are  the  social  values  of  the  differences 
between  giving  consumption  to  the  agents  in  the  three  different  positions 
rather  than  holding  the  resources.   The  right  Sides  are  the  social  values  of 
the  resource  savings  from  the  induced  changes  in  labor  supply  as  a  consequence 
of  altered  benefits.   The  private  return  to  working  for  someone  j.neligible  for 
disability  benefits  is  Cg  -  Ct.   Comparing  this  to  the  marginal  product,  we 
see  that  there  is  an  implicit  tax  on  wortc  when  Cg  -  c^  <  1.   From  (5c),  we 
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will  have  an  implicit  tax  on  work  if,  at  the  optimum  v'{Cij)  >  X.   Below  we  will 
note  plausible  sufficient  conditions  for  this  conclusion. 

Dividing  the  expressions  in  (5)  by  the  marginal  utilities  on  the  right 
hand  side  and  adding,  we  see  that  the  inverse  of  the  Lagrangian  equals  the 
average  of  the  inverses  of  the  marginal  utilities  of  consumption 

|(u'(cJ)-MF  +  f   [(l-p(0))  (u/(cj)-i+p(e)  (v'(Cd))-^]dF 

0  Bh 

(6) 

+  f   [(l-pO))  (v/(Cb))-^  +  PO)  (v'(c<i))-i]dF  =  k-^ 

Ob 
We  turn  next  to  sufficient  conditions  for  the  solution  to  be  internal. 
The  question  of  when  it  is  desirable  to  have  some  work  in  the  economy  (81,  >  0) 
is  a  question  of  the  wealth  of  the  economy;  i.e.  the  size  of  R.   If  no  one 
works,  it  is  optimal  for  everyone  to  have  the  same  consumption  {c^  =   c^  =  R)  . 
This  allocation  will  not  be  optimal  if  those  with  the  least  disutility  of 
labor  would  choose  to  work  for  the  additional  consumption  equal  to  their 
marginal  products . 

u(R  +  l)  >  v(R)  (7) 

We  assume  that  (7)  is  satisfied  (implying  61,  >  0)  and  refer  to  it  as  the 
poverty  condition. 

We  turn  next  to  sufficient  conditions  for  disability  benefits  to  exceed 
welfare  benefits.   Consider  the  economy  with  a  single  benefit  program.   The 
level  of  benefits  is  chosen  relative  to  the  consumption  of  workers  to  balance 
the  effect  on  labor  supply  with  the  desire  to  allocate  consumption  efficiently 
over  workers  and  nonworkers .   Introducing  a  small  disability  program  permits 


higher  benefits  for  a  fraction  of  nonworkers ,  j  p(6)dF,  at  a  cost  of  the 


decrease  in  labor  supply  of  size  p(9j,)f  (0^)  .   Without  a  separate  disability 
program,  the  attempt  to  pay  more  to  all  nonworkers  raises  benefits  for  the 
l-FOb)  nonworkers  and  decreases  the  labor  supply  by  f  Ob)  .   Thus,  having 
p (6)  increasing  in  6  is  sufficient  to  make  a  disability  program  desirable 
provided  the  marginal  utility  of  consumption  of  nonworkers  exceeds  thaz  of 
workers  at  the  optimxam  without  a  disability  program.^ 

A  sufficient  condition  for  this  latter  relation  can  be  expressed  in 
terms  of  the  Diamond  and  Mirrlees  (1978)  moral  hazard  condition  that  equating 
utilities  between  nonworkers  and  workers  who  like  work  the  most  leaves 
marginal  utility  higher  for  nonworkers: 

u(x)  =  v{y)  implies  u'(x)  <  v'(y)  ^°^ 

By  (7)  there  is  some  work  at  the  optimal  allocation.   Thus  at  least  the  most 
able  worker  (8  =  0)  must  work,  implying  u(Ca)  >  vCc^)  .   And  so,  by  (8),  u' (Cg) 
<  V (cb)  .   While  it  is  not  so  plausible  that  (8)  holds  for  workers  wi  h  6=0, 
it  is  plausible  that  the  same  condition  does  hold  for  a  £;;fficient  range  of 
values  of  6  to  yield  the  condition  needed,  that,  at  the  optimum  u' (c^)  < 
v' (Cb)  .   We  assume  that  (8)  is  satisfied  and  conclude  that  Cj  >  Cb. 

From  the  equation  for  A,  (5),  we  then  have  A  <  v' (Cb) .   From  the  first 
order  condition,  (5c),  we  then  have  an  implicit  tax  on  work,  1  >  Cg  -  Cb.   It 
follows  that  1  >  Cg  -  Cd  and  so,  from  (5b)  X  ^  v'(c^)  ,  with  equality  if  6^  =  0. 


^To  see  this  formally,  add  (5b)  and  (5c)  under  the  condition  Cj,  =  c^  and  so 
6b  =  6d-  Tl^s  fact  that  p(9)  is  increasing  will  then  imply  that  the  left  hand 
side  of  (5b)  exceeds  the  right  hand  side  provided  that  v' (c^)  >  A. 


We  can  then  conclude  that  X   >  u'(Ca)  .   Thus  both  welfare  and  disability 
programs  have  benefit  levels  below  the  level  which  would  equate  the  marginal 
utilities  of  beneficiaries  with  that  of  workers. 
4.   Labor  Disutility  Observable 

We  have  used  the  realistic  assumption  that  labor  disutility  (or  disabil- 
ity) is  not  perfectly  observable.   It  may  assist  in  the  interpretation  of  the 
first  order  conditions  to  analyze  the  full  optimum  where  disability  is 
observable.   The  maximization  problem  in  (4)  is  changed  by  making  Gj,  and  6^ 
control  variables  rather  than  endogenously  determined  by  benefit  levels  in 
(2).   In  this  case,  there  is  no  reason  for  a  distinction  between  welfare  and 

disability  benefits  (ct  =  cj)  .   Consumption  is  optimally  allocated  to  equate 
marginal  utilities  of  consumption 

u'(c^)  =v'(c^)  (9) 

All  individuals  with  disutility  levels  below  some  cutoff  6*  should  work.   The 
cutoff  is  determined  by  comparing  the  utility  gain  from  extra  work 
u  (Cg)  -  0*  -  v (Cd)  with  the  value  of  extra  net  consumption  as  a  consequence  of 

work,  which  is  the  sum  of  marginal  product  and  the  change  in  consumption  which 
results  from  the  change  in  status 

U(C:)  -e*-v{Cd)  =  -  u/(Ca*)  (1-C>C^)  <10> 

The  allocation  determined  by  (9),  (10),  and  the  resource  constraint  (4b) 
differs  from  that  analyzed  in  Section  3  in  that  6*  is  determined  by  social 
needs  while  O^,  and  6^  are   determined  by  workers  to  equate  utilities  with  and 
without  work.   From  the  moral  hazard  condition  we  know  that  an  attempt  to 
implement  an  allocation  satisfying  (9)  results  in  no  one  choosing  to  work  when 
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disutility  is  not  observable.   Thus,  without  observability  it  is  optimal  to 
adjust  consumption  levels  away  from  the  equal  marginal  utility  condition  to 
induce  a  more  appropriate  labor  supply. 
5.   Two  Class  Model 

To  get  some  additional  feeling  for  the  nature  of  the  first  order 
conditions,  we  consider  a  case  where  there  are  only  two  types  in  the  economy, 
6i   and  63  (61  <  62)  with  population  weights  f^   and  £2   =  l-fi-   The  probability 
of  a  type— i  person  being  judged  disabled  is  p^,  with  P2  >  Pi-   We  assume  that 
the  optimum  has  the  form  that  the  S^  types  accept  disability  benefits  if 
judged  disabled  but  otherwise  work;  while  the  62  types  don't  work  whatever 
their  eligibility  for  disability  benefits.   For  a  logarithmic  example, 
sufficient  conditions  for  the  optimum  to  have  this  form  will  come  out  of 
analysis  of  that  special  case. 

The  social  welfare  optimization  now  takes  the  form 


Max   [(u(c3)    -  e^)  (1-Pi)    +v(Cd)Pi]fi 

+      [v(Cb)(l-P2)      +V(Cd)P2]f2 


(11a) 


subject   to 


(C^-1)    (l-Pi)fi    +    CdPifi    +    Cb(l-P2)f2    +    CdPzfs    =   R  <11^> 

v(Cd)  ^  u(c^)  -  e^  ^  v(Cb)  ^  u(cj  -  62  (lie) 

c,  ^  c,  (lid) 

The  constraints  in  (lie)  ensure  that  individual  behavior  coincides  with  that 
described  in  the  objective  function  and  resource  constraint.   One  can  check 
that  the  condition  u(c3)  -  Q^  i^  v(ct,)  and  the  moral  hazard  condition  (8)  imply 
that  at  the  optimum 
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u(cj  -  e,  =  v(Cb)  (12) 

Thus  the  optimum  is  described  by  the  resource  constraint  (lib) ,  the  equal 
utility  condition  (12)  and  a  single  first  order  condition  which  can  be 
obtained  by  differentiating  social  welfare  in  (11a)  with  respect  to  cj,  with 
Cg  and  Cb  functions  of  Cj  given  by  (lib)  and  (12)  .   Upon  differentiation  we 
have 

N=  {(l-p,)fX(Cb)  [v'(cj  -u/(cj] 

+  (1-P2)f2u'(cj  [v/(Cd)  -v/(c^)]}  (13) 

^^    =  N[p,f,  +  p^f^]  [v'(Cb)  (l-Pi)fi  +  u'(cj  (1-P2)fj-^ 


dCd 


The  moral  hazard  condition  implies  that  -= —  is  positive  at  c^j  =  Cj,.   Thus  the 

optimum  has  a  separate  disability  program.   Since  c^   does  not  affect  behavior 
at  the  margin,  v' (cj)  is   equal  to  the  Lagrangian  on  the  resource  constraint. 

Of  the  various  comparative  static  exercises  one  could  do,  we  restrict 
ourselves  to  one  simple  one.   Consider  a  set  of  different  economies  which 
happen  to  have  chosen  the  same  levels  of  Cg  and  Cf   Assume  that  the  different 
economies  have  different  levels  of  resources  R  and  different  levels  of  one  of 
the  variables  p^,  p2,  or  f^.   Then,  comparative  statics  is  done  by  examining 
the  equation,  N  =  0.   Differentiating  N,  and  evaluating  where  N  =  0,  we  have 

SN  <  0,^  <  0,4^  >  0,-S-    >    0  (14) 


dc^  5Pi     3P2     ^fi 

Thus,  across  these  different  economies,  those  with  a  more  discriminating 
disability  system  (p^  smaller  or  p2  larger)  or  a  smaller  population  with  high 
disability  have  larger  benefit  levels.   While  the  former  proposition  is 
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intuitive,  the  latter  rests  on  the  fact  that  for  a  program  increase  there  must 
be  decreases  in  both  wages  and  welfare  benefits  to  finance  the  change  and  this 
is  more  attractive  the  smaller  the  welfare  population. 
6.   Disability  Standard 

Relative  to  the  set  of  people  one  would  ideally  like  to  see  receiving 
disability  benefits,  any  attempt  to  evaluate  abilities  to  wcrk  will  be  subject 
to  two  types  of  error  —  admission  of  people  ideally  omitted  ..ad  exclusion  of 
people  ideally  included.^  In  this  section,  we  generalize  the  model  of 
Section  5  to  relate  the  probabilities  of  a  disability  finding,  p^  and  P2,  to  a 
disability  standard,  6*.   We  also  consider  the  first  order  condition  for 
optimization  of  0*.   In  the  next  section  we  will  extend  the  analysis  to  the 
continuous  case. 

Individuals  are  now  characterized  by  two  variables,  0  and  0g,  being 
true  and  estimated  disutilities  of  labor.   We  assume  that  the  distributions^" 
of  0g  for  the  two  types  are  Gi(0e)  and  02(85).   Since  observation  of  0g  is 
merely  a  device  for  making  inferences  about  6 ,  it  is  possible  to  define 
measured  disability  so  that  two  conditions  are  met  (see  Milgrom  (1979)). 
First  (which  is  implied  by  the  second;   the  assumption  that  p^  <  P2  for  any 
disability  standard  0* 

Pi  (0*)  =  1  -  G^  (6*)  <  1  -  G2  (6*)  =  P2  (6*)  (15) 

Second,  the  appropriate  definition  of  measured  disability  gives  '-.s  the 
monotone  likelihood  ratio  condition  for  all  0* 


^e  draw  on  the  analysis  of  auto  insurance  underwriting  by  Smallwood  (1975) . 

^°We  assume  that  G^  and  G2  have  positive  support  on  the  entire  positive 
half line. 
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gi(e')  <  93  (9*)  (16) 

gi(e')   g^O*) 

In  keeping  with  the  analysis  of  the  previous  section,  we  would  like  to 
exclude  all  type  ones  from  disability  benefits  while  including  all  type  twos. 
However  some  type  ones  are  included  for  any  finite  standard  and  some  type  twos 
are  excluded  for  any  positive  standard.   Thus  it  is  inevitable  that  some 
errors  of  classification  are  made.   The  problem  is  to  strike  the  appropriate 
balance  between  the  two  types  of  error. 

Continuing  to  assume  that  the  optimum  occurs  with  type  ones  accepting 
disability  but  not  welfare  benefits  while  type  twos  do  not  work,  we  have  the 
same  maximization  problem  as  in  (11)  with  p^  now  functions  of  the  control 
variable  6*.   Recalling  that  the  Lagrangian  is  equal  to  v'(Cj)  ,  the  first 
order  condition  for  6*  is 

[v(Cd)  -  v(Cb)]  (f^gi  +  f2g2(6*)) 

=  v/(cj  [(c^  +  1  -  cjf,g,(0*)  +  (c^  -  cjf2g2(e*)] 
Rearranging  terms  it  is  instructive  to  write  the  first  order  condition  as 

(18) 

=  [v'(Cd)  (Cd  +  1  -  cj  -  (v(cj  -vic^))]f^gJQ*) 

By  the  concavity  of  v,  the  left  side  is  positive  at  the  optimum.   Thus  the 
right  side  is  positive,  implying  that  l  -  c^  +  c^  is  positive;  this  also 
follows  from  the  presence  of  an  implicit  tax  on  work  as  part  of  the  optimal 
structure  which  was  argued  in  Section  3. 

Using  the  monotone  likelihood  ration  condition,  (16)  ,  one  can  draw 
comparative  statics  from  (18)  .   Let  us  consider  a  6*  and  a  set  of  economies 
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that  have  the  same  levels  of  benefits  and  wages,  the  same  g^  functions,  but 
different  levels  of  R  and  f.   Considering  (13)  and  (18),  this  can  only  happen 

where  — 5— i  is  independent  of  6  for  some  range.   Thus,  we  can  differentiate 

(18)  alone,  knowing  that  (13)  continues  to  be  satisfied.   By  implicit 
differentiation  of  (18),  we  see  that  0*  is  larger  the  greater  the  size  of  the 
type  one  disability  eligible  population. 
7.   Disability  Standard:   Continuous  Case 

In  a  similar  fashion,  we  can  introduce  a  choice  of  disability  standard 
into  the  continuous  case.   We  write  the  distribution  of  observed  disutility, 
6g,  conditional  on  true  disutility  being  6  as  0(8^;  6)  ,  with  density^^ 
gOg/O)  .   Then,  for  disability  standard  6*.  we  have 

p(e)  =  1  -  G(e*;e)  (i^) 

Returning  to  the  welfare  maximization  problem,  (3.4),  we  can  now  add  a 
further  first  order  condition  by  differentiating  with  respect  to  6*  (assuming 

e*  >  0) 

Ob 

[v(Cd)  -  u(cj  +  e]g(e*;e)dF(e) 


+  f  [v(Cd)  -  v(Cb)]g(e*;e)dF(e) 

Ob 
Ob 

=  A.[|  [Cd  -  c^  +  l]g(e*;e)dF(e) 

Od 

+  /  [Cd  -  cjg(e*;e)dF(e)] 


(20) 


^^For  convenience,  we  assume  that  g  is  positive  for  all  positive  values 
of  e„. 
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Thus  a  loosening  of  the  standard  attracts  some  people  to  the  disability 
program  from  the  labor  force  while  shifting  others  from  retirement.   The 
former  shift  has  a  net  social  value  v(C;j)  -  Xc^  -   {n{c^)    -  Q  -  'k(c^-x))    while 
the  latter  has  a  social  value  v(Cj)  -  Ac^  -  (v(Cb)  -  Xc^)  .      We  write  these 
differences  as  A^  +  6  and  A^,  respectively.   Across  the  set  of  people  shifted, 
these  net  gains  must  balance  out  for  an  optimum.   That  is,  the  first  order 
condition  (20)  can  be  written  as 

Afc  J  g(e*;e)dF(e)  +  a^  J  g(e';e)dF(e)  +  f  eg(e*;e)dF{e)  =o     (21) 

From  the  concavity  of  v  and  the  fact  that  v'(c^)  ^  X  we  have 

v(Cd)  -  v{Cb)  >  v'(Cd)  (Cd  -  Cfa) 

Thus,  at  the  optimum,  with  a  decrease  in  disability  standards,  social  welfare 
goes  up  from  additional  nonworkers  added  to  the  disability  rolls  (A^,  >  0) 
while  it  goes  down  from  at  least  some  of  the  workers  switching  to  disability 
iK  <   0)  • 

8.   Multiple  Disability  Standards 

If  individuals  with  different  health  problems  (e.g.  physical  vs.  mental, 
or  sight,  lower  back,  etc.)  have  different  distributions  of  actual  and 
observed  disutilities,  we  can  think  of  the  population  as  made  up  of  the  sum  of 
distributions  Fi(6)  with  conditional  distributions  of  observed  disutilities 
Gi(0q;6)  .   With  the  same  benefits  for  all  recipients  (as  with  the  US  disabili- 
ty program)  and  the  same  Lagrangian  for  the  single  government  budget  con- 
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straint,  the  first  order  conditions  (20)  tell  us  how  the  standard  61    should 

vary  with  distribution  GjOg/G)  ,  while  the  other  first  order  conditions  are 

modified  by  adding  over  different  health  problems.   In  (21),  61,    F^ ,  and  g^ 

are  the  only  elements  to  vary  with  respect  to  i. 

To  proceed,  we  now  assume  the  analog  of  (18),  which  Milgrom  (1979)  calls 
the  local  monotone  ratio  property 

^log  J-g.  (e.;9)  >0  (23) 

Unlike  the  two-class  case,  this  is  an  assumption  on  the  joint  distribution  of 
6g  and  6  rather  than  following  from  a  suitable  definition  of  6g.  Given  (23), 
the  left— hand  side  of  (21)  is  increasing  in  6*.  It  is  natural  to  ask  how  the 
standard  should  vary  with  the  distribution.  That  is,  across  different  health 
problems,  how  does  6*  vary  with  F^  and  g^  in  equation  (21).   We  shall  examine 

the  special  case  where  the  distributions  of  disutilities  are  the  same, 
Fi(6)  =  F2(e) ,  but  the  conditional  distributions  G^  are  different.   A  simple 
rightward  shift  for  the  difference  between  distributions  is  obviously  offset 
by  a  comparable  shift  in  6*.   The  interesting  case  to  analyze  would  be  for  a 
difference  in  the  shape  of  the  distributions. 
9.   Losarithmic  Example 

This  section  explores  in  more  detail  the  two  class  model  of  Section  5 
with  a  specification  of  logarithmic  utility  for  workers  and  non— workers . 
There  are  six  possible  configurations  of  the  economy.   Workers  of  each  tjrpe 
might  either  work  (case  I),  apply  for  disability  and  work  if  it  is  not  granted 
(case  II) ,  or  apply  for  disability  and  retire  if  it  is  not  granted  (case  III) . 
Since  disutility  of  work  is  greater  for  type  2  workers  than  for  type  1  workers 
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{Q^   <  62)  ,  there  are  six  possible  configurations  as  follows. 

Configuration  l-CI.I).  Both  types  work  and  do  not  apply  for  disability.  The 
resource  constraint  implies  C^  =  R  +  1  ,  social  welfare; 

Wi  =  iog(R  +  i)  -  e^fi  -  e^fs 

Configuration  2-(I.II).  Tjrpe  1  works,  type  2  applies  for  disability  and  works 
if  it  is  not  granted.   The  incentive  compatibility  constraints  are 

log  (Cb)  ^  log  (c^)  -  B^  ^   log  (c^)  ^  log  (c^)  -  Q^  (24) 

From  (24) ,  the  marginal  utility  for  persons  on  disability  is  higher  than  for 
those  working.  The  social  planner  thus  wants  to  make  c^  as  high  as  possible, 
consistent  with  (24) .   Hence 


log  (c^)  =  log  (c^)  -  6^  or   c^  =  c^e 


-9i 


(25) 


The  resource  constraint  is 

(C^  -  1)  fl  +  CdPsfz  +  (Ca  -  1)  (1  -P2)  f 2  =  R 
Solving  (9.2)  and  (9.3)  gives 


'^•'   ,-«. 


R  +  1  -  p.f 


2-^2 


e  ^Ppf,  +  1  -  Psf 


2-^2 


(26) 


(27) 


Social  welfare  is 

W2=(log(cJ    -  0i)fi  +  log(Cd)P2f2  +   (log(cJ    -e2)(l-P2)f2 
Using   (25)    and   (27)    gives 


W,  =  log 


/ 


R  +  1   -  Pjf 


2-^2 


-e. 


-  e,(f,  -Hp^f^)  -  e2(i-P2)f, 


[e    ^p^fa  +  1  -  P2f2J 
Configuration  3-(I.III).  Type  1  works,  type  2  applies  for  disability  and 
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retires  if  it  is  not  granted.   The  incentive  compatibility  constraints  are 

log  (c^;  -  62  ^  log  (Cb)  ^  log  (c^)  ^  log  (cj  -  6^       (28) 

From  (28)  the  marginal  utility  of  persons  on  retirement  or  disability  is 
greater  than  that  of  workers.   Thus  the  social  planner  sets  c^  and  c^   as  high 
as  possible  consistent  with  (28): 


log(Cb)  =  log(Cd)  =  log  (c^)  -  B^    or  c^,  =  c^  =  c^e"®^ 


(29) 


The  resource  constraint  is 

(c^-1)-:,  +  c^p^fs  +  Cb(l-P2)f2  =  R  (30) 

Solving  (9.6)  and  (9.7)  gives 


R  +  f, 
C.  =  ^-^—  (31) 


a 


Social  welfare  is 


W3  =  (log  (c^)  -  61)  fi  +  log  (Cd)P2f2  +  log  (^o'  (1  "P2)f2 
Using  (9.6)  and  (9.8)  gives 


W,  =  log 


V  ■'-2^     ~i 


Configuration  4-(II.II).   Both   oes  apply  for  disability  and  work  if  it  is 
not  granted.   The  incentive  compatibility  constraints  are 

log  (Cb)  ^  log  (c^)  -  63  ^  log  (c^)  -  6^  ^  log  (c^) 

The  social  planner  equalizes  marginal  utility  for  workers  and  disabled  persons 
by  setting 

Ca  =  C,  (32) 
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The  resource  constraint  is 


(C^-1)    ((1-P,)fl     +      (1-P2)f2)      +    Cd(Pifi+P2f2)      =R  (33) 

Solving  (32)  and  (33)  gives 

C^  =  R  -H  1  -  p,fi  -  p^f^  (3^) 

Social  welfare  is 

W^  =  (log(c^)  -ej  (l-pi)fi  +  log(Cd)Pifi+  (log(c^)  -Oj)  (l-p2)f2  +  log  (c<j)p2f2 
Combining  (32),  (34),  and  (35)  gives 

W4  =  log  (R  +  1  -  p^fi  -  P2f2)  -  Si  (1  -  p^)  f^  -  62  (1  -  P2)  fz 

Configuration  5— (II. III).   Both  types  apply  for  disability.   If  disability  is 
not  granted,  type  I's  work  while  type  2's  retire.   This  is  the  configuration 
analyzed  in  Section  5.   With  u(x)  and  v(x)  logarithmic,  solving  (lib),  (12), 
and  (13)  gives 


C^  =  (R  +  (1  -  p,)f,) 


(1-P,)f,  -H  (1-P2)f2 

(l-Pi)fi  +  e"®Ml-P2)f2 


Cb  =  c^e 


-61 


Cd  =  R  +  (l-Pi)fi 


(36) 

(37) 
(38) 


W5  =  log(R  +  (l-pjf  J  + 


log  (i-Pi)fi-_(i-P2)£2  _  e^ 


-e, 


(l-Pi)fi  +  e-''Ml-P2)f 


[(1-P,)fl  +  (1-P2)f2)] 


Configuration  6-(III.III).   Both  types  apply  for  disability  and  retire  if  it 
is  not  granted.   The  incentive  compatibility  constraints  are 

log  (c^)  -  62  ^  log  (c^)  -  6^  ^  log  (Cj,)  ^  log  (c^) 
Setting  C)3  =  Cjj  equalizes  marginal  utility  for  persons  on  retirement  and 


20 
persons  on  disability.   From  the  resource  constraint  C^j  =  Cj,  =  R  ;  social 

welfare  is  Wg  =  log  (R)  . 

Figures  2—6  present  social  welfare  and  benefit  levels  under  the  optimal 
program  for  parameter  variations  around  a  base  setting  of 

e^  =  1,  02  =  4,  fi  =  fj  =  .5,  Pi  =  .25,P2  =  .7  5,  and  R  =  .45.12 

Figure  2  shows  how  different  configurations  are  optimal  for  different 
levels  of  R.   Note  that  configuration  4,  in  which  both  types  app^y  for 
disability  and  work  if  it  is  not  granted,  is  not  optimal  for  any  value  of  R, 
given  the  above  base  settings  for  the  oth  r  parameters.   The  difference 
between  pi  and  p2,  i.e.  the  effectiveness  of  disability  screening,  needs  to  be 
increased  significantly^''  to  generate  co.i  ..juration  4.   That  is  not 
surprising,  since  in  configuration  4  both  types  make  the  same  labor  supply 
choice  at  the  optimal  consumption  levels.   Disability  screening  has  to 
compensate  for  differences  in  disutility  of  work  in  order  to  have  this  pattern 
of  benefits  optimal. ^^   Social  welfare  is  continuous  in  resources,  although 
consumption  levels  are  not,  with  discontinuous  jump:-  when  the  configuration 


i^If  workers  were  free  to  choose  their  hours  h  given  wage  —  per  hour  and 

8 

e. 

disutility  of  work  -^    per  hour,  they  would  choose  h  so  as  to  maximize 

8 

logi  — j  -  h-^  .   The  6^^ '  s  7ere  chosen  so  that  type  1  would  want  to  work  8  hours 

while  type  2  would  want  t,-  work  2  hours.   The  base  setting  for  F,  was  chosen  so 
as  to  be  in  the  middle  of  the  range  of  R  for  which  configuration  5  is  optimal 
(see  Figure  2) . 

i^For  example,  there  is  configuration  4  with  Pi  <  .08  for  P2  =  -75  at  the 
base  setting  of  the  other  parameters. 

^^Configurations  1  and  6,  which  also  involve  symmetric  labor  supply  choices, 
can  always  be  generated  by  sufficiently  extreme  values  of  R. 
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changes.   As  resources  increase,  avoiding  the  disutility  of  work  becomes  more 
attractive.   Each  configuration  change  is  an  increase  in  the  number  of  people 
not  working  financed  in  part  from  a  fall  in  wages .   Presumably  the 
discontinuity  would  not  be  present  with  a  continuous  distribution  of  types. 

Figure  3  shows  the  effects  of  varying  f^  given  the  base  settings  for  the 
other  parameters.   If  f^  is  less  than  .05,  configuration  3  is  optimal; 
otherwise,  configuration  5  is  optimal,  as  is  the  case  for  the  range  shown. 
From  (38)  one  can  verify  that  c^  is  linear  in  f^  in  configuration  5.   From 
(37)  one  sees  that  c^,  is  a  constant  fraction  of  c^.   It  is  interesting  that  c^ 
and  Cjj  are  not  monotone  in  f^,  although  social  welfare  rises  as  f^  increases, 
since  a  rise  in  f^  means  that  the  share  of  workers  with  low  disutility  has 
increased. 

Figures  4—6  show  the  effects  of  variations  in  the  disability  screening 
parameters  p^^  and  p2.   By  definition  P2  >  Pi-   Increases  in  p^,  holding  P2 
constant  at  the  base  setting  .75,  cause  a  fall  in  social  welfare  and  benefit 
levels  (see  Figure  4).   This  happens  because  a  larger  fraction  of  type  I's  are 
granted  disability,  hence  a  smaller  fraction  work,  and  there  is  less  income 
generated  in  the  economy.   If  p^^  is  pushed  into  the  range  .62  —  .75, 
configuration  5  is  no  longer  optimal;  instead,  configuration  3  is. 

An  increase  in  P2,  holding  p^  constant  at  the  base  setting  .25,  causes 
social  welfare  to  rise  (see  Figure  5).   An  increase  in  the  probability  of  type 
2  being  awarded  disability  means  that  a  larger  share  of  income  is  directed 
toward  type  2's.   Since  the  marginal  utility  of  income  is  greater  for  type 
2's,  social  welfare  rises.   Note  from  (38)  that  c^  is  independent  of  p2.   Thus 
Cg  and  Cb  fall  to  finance  the  increased  size  of  the  disability  program. 

Figure  6  shows  the  effect  of  varying  the  ratio  P2/P1,  holding  the 
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fraction  awarded  disability,  p^f^  +  Pafa,  constant  at  the  base  level  of  .5. 
This  amounts  to  varying  the  efficiency  of  disability  screening  while  holding 
awards  constant.   As  the  screening  technology  improves  (Pa/Pi  increases),  the 
incentive  compatibility  constraint  weakens  and  hence  social  welfare  rises.   In 
the  limit  as  P2/P1  "^  «",  the  same  benefits  can  be  given  to  workers  of  type  1 
and  to  non— workers  of  type  2,  i.e.  C;j  =  Cg.   Note  that  Cj,  is  fixed  at  a 
constant  fraction  of  Cg  by  (37)  ,  but  Cj,  becomes  irrelevant  as  the  fraction  of 
type  2's  awarded  disability  approaches  1. 
10.   Concluding  Remarks 

We  have  examined  how  benefits  and  disability  standards  would  be  set  by  a 
very  knowledgeable  social  welfare  maximizing  government.   We  have  assumed  that 
it  is  costless  to  evaluate  disability  applicants.   That  is  obviously  wrong. 
If  there  were  a  social  cost  but  no  private  cost  to  applying,  the  social  cost 
would  be  paid  by  all  the  1-F(6j)  individuals  who  would  apply  for  benefits. 
By  introducing  a  fee  for  applying,  the  government  could  cut  down  the  number  of 
applicants.   The  value  of  such  a  fee  depends  on  the  distribution  of  disabili- 
ties among  individuals  dissuaded  from  applying.   Particularly  attractive  would 
be  a  fee  negatively  related  to  the  disutility  of  work.   The  absence  of  work 
for  a  prior  period  would  be  such  a  fee,  although  it  represents  a  social  cost 
for  those  who  are  rejected  and  for  whom  6  lies  between  6^  and  6^.      Purely 
financial  application  fees  avoid  some  of  the  social  cost,  but  possibly  not  all 
since  they  may  involve  undesirable  income  redistributions  (as  would  be 
apparent  with  wealth  as  well  as  disutility  differences  in  the  population) . 
More  important  is  the  distribution  of  disincentives  to  apply.   The  optimal 
application  fee  structure  would  be  an  interesting  extension  of  the  model. 
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